Total Variation Distance (TVD, or variation distance or statistical distance)
= the maximum absolute difference in probabilities assigned to any single event by the two distributions.
range in [0, inf/diameter of the support of the distributions]
useful for comparing distributions with complex shapes or high-dimensional data
Hellinger Distance
= the similarity between two probability distributions by comparing the square root of the difference between the square roots of the probability densities.
range in [0, 1]
sensitive to differences in the tails of distributions
= a symmetrized version of KL divergence that measures the similarity between two distributions by averaging their KL divergences from the average of the two distributions.